Automata Guided Hierarchical Reinforcement Learning for Zero-shot Skill Composition

نویسندگان

Xiao Li

Yao Ma

Calin Belta

چکیده

An obstacle that prevents the wide adoption of (deep) reinforcement learning (RL) in control systems is its need for a large amount of interactions with the environment in order to master a skill. The learned skill usually generalizes poorly across domains and re-training is often necessary when presented with a new task. We present a framework that combines methods in formal methods with hierarchical reinforcement learning (HRL). The set of techniques we provide allows for convenient specification of tasks with complex logic, learn hierarchical policies (meta-controller and low-level controllers) with well-defined intrinsic rewards using any RL methods and is able to construct new skills from existing ones without additional learning. We evaluate the proposed methods in a simple grid world simulation as well as simulation on a Baxter robot.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Pursuit Reinforcement Competitive Learning: PRCL based Online Clustering with Learning Automata

A new online clustering method based on not only reinforcement and competitive learning but also pursuit algorithm (Pursuit Reinforcement Competitive Learning: PRCL) as well as learning automata is proposed for reaching a relatively stable clustering solution in comparatively short time duration. UCI repository data which are widely used for evaluation of clustering performance in usual is used...

متن کامل

Zero-Shot Task Generalization with Multi-Task Deep Reinforcement Learning

As a step towards developing zero-shot task generalization capabilities in reinforcement learning (RL), we introduce a new RL problem where the agent should learn to execute sequences of instructions after learning useful skills that solve subtasks. In this problem, we consider two types of generalizations: to previously unseen instructions and to longer sequences of instructions. For generaliz...

متن کامل

Pursuit Reinforcement Competitive Learning: PRCL based Online Clustering with Tracking Algorithm and its Application to Image Retrieval

Pursuit Reinforcement guided Competitive Learning: PRCL based on relatively fast online clustering that allows grouping the data in concern into several clusters when the number of data and distribution of data are varied of reinforcement guided competitive learning is proposed. One of applications of the proposed method is image portion retrievals from the relatively large scale of the images ...

متن کامل

Signals Reinforcement Inputs Sensory Actions Skill Skill Skill

While the need for hierarchies within control systems is apparent, it is also clear to many researchers that such hierarchies should be learned. Learning both the structure and the component behaviors is a diicult task. The beneet of learning the hierarchical structures of behaviors is that the decomposition of the control structure into smaller transportable chunks allows previously learned kn...

متن کامل

A new Evolutionary Reinforcement Scheme for Stochastic Learning Automata

A stochastic automaton can perform a finite number of actions in a random environment. When a specific action is performed, the environment responds by producing an environment output that is stochastically related to the action. The aim is to design an automaton, using an evolutionary reinforcement scheme (the basis of the learning process), that can determine the best action guided by past ac...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

CoRR

دوره abs/1711.00129 شماره

صفحات -

تاریخ انتشار 2017

Automata Guided Hierarchical Reinforcement Learning for Zero-shot Skill Composition

نویسندگان

چکیده

منابع مشابه

Pursuit Reinforcement Competitive Learning: PRCL based Online Clustering with Learning Automata

Zero-Shot Task Generalization with Multi-Task Deep Reinforcement Learning

Pursuit Reinforcement Competitive Learning: PRCL based Online Clustering with Tracking Algorithm and its Application to Image Retrieval

Signals Reinforcement Inputs Sensory Actions Skill Skill Skill

A new Evolutionary Reinforcement Scheme for Stochastic Learning Automata

عنوان ژورنال:

اشتراک گذاری